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COLOR SELECTION SCHEME FOR DIGITAL VIDEO WATERMARKING 

Cross-Reference to Related Applications 

U.S. patent applications Serial No. (Zarrabizadeh 22) and Serial No. 
(Zarrabizadeh 23) were filed concurrently herewith. 

5 Technical Field 

This invention relates to the art of watermarking digital video, and more 
particularly, to selecting which chrominance portion should be watermarked. 

Background of the Invention 

Watermarking of video signals is, generally, the inclusion within the video itself 

10 of additional information. This can be useful to provide an embedded identification of 
the source of a video, to keep track of where and for how long a video is played, and to 
communicate information via the video to an ancillary device. Prior art techniques for 
watermarking video signals typically encoded the additional information in an analog 
format within the video itself using the luminance of the video to carry the additional 

15 information. However, the human visual system is very sensitive to the luminance signal, 
and so a person viewing a watermarked signal easily perceives distortion which is caused 
by the changes made to the video signal to convey the additional information when there 
is an attempt to increase the bit rate of the additional information beyond a certain point, 
e.g., beyond 120 bits per second. Thus, although the prior art's techniques of 

20 watermarking of video signals has had some success in certain applications, such success 
has been limited by the extremely small bit rate that is achievable without perceivable 
distortion by a person viewing the video signal carrying the additional information. 

In previously filed United States Patent Application Serial No. 10/342704, which 
is incorporated by reference as if set forth fully herein, I, along with my coinventor, 

25 recognized that the human visual system is much less sensitive to chrominance than to 
luminance. Therefore, we developed a system for digital watermarking a video signal 
that inserts the additional information of the watermarking signal on the chrominance 
component of the video signal rather than on its luminance signal. Thus, the additional 
information is "impressed" upon the chrominance component of the video signal. 

30 Advantageously, although there may be significant distortion of the chrominance 
component, especially when the additional information has higher bit rates than is 
achievable without perceivable distortion by the prior art, nevertheless such distortion 
will not be detected by the human visual system, provided it is appropriately managed. 
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Thus, the additional information can have a higher bit rate as compared with that 
achievable by the prior art, e.g., bit rates greater than 150 bits per second can be achieved. 
Further advantageously, the additional data can be recovered from the video signal even 
after the video signal watermarked with the additional data is compressed using the 

5 Motion Picture Expert Group (MPEG)-l and MPEG-2 encoding systems. 

The particular chrominance portion selected to carry the watermarking for any 
pixel is selected in United States Patent Application Serial No. 10/342704 by a color 
selection unit. The color selection unit determines the selected chrominance component 
as a function of the RGB and the YUV representations of the pixel using a prescribed 

10 formula. Since digital video is often transmitted only in YUV format, to use the system 
of United States Patent Application Serial No. 10/342704 with such YUV formatted 
video, it is necessary to develop therefrom the corresponding RGB formatted video. 
Disadvantageous^, to do so requires considerable processing power. Furthermore, 
although it is very good, it was later discovered that the mathematical model underlying 

15 the formula employed in the selection process of United States Patent Application Serial 
No. 10/3427 does not necessarily always produce flicker-free results. 

Summary of the Invention 

In accordance with the principles of the invention, the process of determining the 
chrominance portion to be watermarked may be improved by employing a 

20 perception-based table that indicates for various pixels which of the chrominance 
portions, if any, should be selected for watermarking. In accordance with an aspect of the 
invention, only values for Y, U, and V of a pixel may be required to access the table and 
determine which chrominance portion should be selected. Advantageously, when the 
digital video is in YUV format, the use of R, G ? and B is not required to select the 

25 chrominance portion, thereby reducing significantly the necessary processing power. In 
accordance with another aspect of the invention, the table may be represented such that it 
may be accessed using only R, G, and B values, so that there is no need to convert a 
source video in RGB format to YUV format in order to perform color selection. 

In one embodiment of the invention, the table is accessed by supplying Y, U, and 

30 V values, which may be decimated and/or quantized, and retrieving from the table an 
indication of whether U or V should be selected. In accordance with yet another aspect of 
the invention, the table may be modified so that it may indicate which of U or V should 
be selected, or that neither should be selected, e.g., when the color of the pixel is dark 
blue and/or dark purple, indicating that this pixel should not be watermarked at all. 



D:\PATENTS\Zarrabizadeh 24\Zarrabizadeh 24.doc 



Zarrabizadeh 24 

In accordance with another aspect of the invention, a mixed-mode of processing 
may be employed using the table and some processing. Advantageously, the table may be 
simplified, e.g., reduced by half its size, because a large section of the table may be 
replaced by a simple test on the pixel values, e.g., U<128, to determine the selected 
5 chrominance portion. 

Advantageously, the table may be changed on the fly without changing the 
underlying process, e.g., computer code, employed in the selection process. 

Brief Description of the Drawing 

In the drawing: 

10 FIG. 1 shows an exemplary transmitter for digital watermarking a video signal, in 

accordance with the principles of the invention; 

FIG. 2 shows an exemplary receiver for recovering the additional data of a video 
signal containing digital watermarking on the chrominance signal thereof, in accordance 
with the principles of the invention; 
15 FIGs. 3 A and 3B, when connected together as shown in FIG. 3, show an 

exemplary process for use in watermarking one of the chrominance portions with 
additional data, in accordance with the principles of the invention; 

FIGs. 4A and 4B, when connected together as shown in FIG. 4, show an 
exemplary process for extracting the additional information from a digitally watermarked 
20 video signal in which the additional information that constitutes the watermarking signal 
within the video signal has been impressed upon the chrominance component, in 
accordance with the principles of the invention; 

FIG. 5 shows an example of several safe ranges where the desired bit position is 
the third least significant bit; 
25 FIG. 6 shows an exemplary process for determining which particular chrominance 

portion is more suitable, and so should be selected, to contain the watermarking 
information for a pixel, in accordance with the principles of the invention; 

FIG. 7 shows a cutaway view of a portion of an exemplary divided colorspace; 
FIG. 8 shows another exemplary process by which the particular chrominance 
30 portion is selected to contain the watermarking information for a pixel, in accordance 
with the principles of the invention; 

FIG. 9 shows an exemplary transmitter arranged in accordance with the principles 
of the invention, in which flickering may be reduced by replicating the data to be 
impressed, at least once, and preferably two or more times, prior to its being impressed 
35 upon the average value of a chrominance portion of a block; and 
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FIG. 10 shows an exemplary embodiment of a receiver arranged in accordance 
with the principles of the invention for use in receiving a watermarked video signal such 
as that produced by the transmitter of FIG. 9. 

Detailed Description 

5 The following merely illustrates the principles of the invention. It will thus be 

appreciated that those skilled in the art will be able to devise various arrangements which, 
although not explicitly described or shown herein, embody the principles of the invention 
and are included within its spirit and scope. Furthermore, all examples and conditional 
language recited herein are principally intended expressly to be only for pedagogical 

10 purposes to aid the reader in understanding the principles of the invention and the 
concepts contributed by the inventor(s) to furthering the art, and are to be construed as 
being without limitation to such specifically recited examples and conditions. Moreover, 
all statements herein reciting principles, aspects, and embodiments of the invention, as 
well as specific examples thereof, are intended to encompass both structural and 

15 functional equivalents thereof. Additionally, it is intended that such equivalents include 
both currently known equivalents as well as equivalents developed in the future, i.e., any 
elements developed that perform the same function, regardless of structure. 

Thus, for example, it will be appreciated by those skilled in the art that any block 
diagrams herein represent conceptual views of illustrative circuitry embodying the 

20 principles of the invention. Similarly, it will be appreciated that any flow charts, flow 
diagrams, state transition diagrams, pseudocode, and the like represent various processes 
which may be substantially represented in computer readable medium and so executed by 
a computer or processor, whether or not such computer or processor is explicitly shown. 

The functions of the various elements shown in the FIGs., including any 

25 functional blocks labeled as "processors", may be provided through the use of dedicated 
hardware as well as hardware capable of executing software in association with 
appropriate software. When provided by a processor, the functions may be provided by a 
single dedicated processor, by a single shared processor, or by a plurality of individual 
processors, some of which may be shared. Moreover, explicit use of the term "processor" 

30 or "controller" should not be construed to refer exclusively to hardware capable of 
executing software, and may implicitly include, without limitation, digital signal 
processor (DSP) hardware, network processor, application specific integrated circuit 
(ASIC), field programmable gate array (FPGA), read-only memory (ROM) for storing 
software, random access memory (RAM), and non-volatile storage. Other hardware, 

35 conventional and/or custom, may also be included. Similarly, any switches shown in the 
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FIGS, are conceptual only. Their function may be carried out through the operation of 
program logic, through dedicated logic, through the interaction of program control and 
dedicated logic, or even manually, the particular technique being selectable by the 
implementer as more specifically understood from the context. 

5 In the claims hereof any element expressed as a means for performing a specified 

function is intended to encompass any way of performing that function including, for 
example, a) a combination of circuit elements which performs that function or b) software 
in any form, including, therefore, firmware, microcode or the like, combined with 
appropriate circuitry for executing that software to perform the function. The invention 

10 as defined by such claims resides in the fact that the functionalities provided by the 
various recited means are combined and brought together in the manner which the claims 
call for. Applicant thus regards any means which can provide those functionalities as 
equivalent as those shown herein. 

Software modules, or simply modules which are implied to be software, may be 

15 represented herein as any combination of flowchart elements or other elements indicating 
performance of process steps and/or textual description. Such modules may be executed 
by hardware which is expressly or implicitly shown. 

Unless otherwise explicitly specified herein, the drawings are not drawn to scale. 
In the description, identically numbered components within different ones of the 

20 FIGs. refer to the same components. 

FIG. 1 shows exemplary transmitter 101 for digital watermarking a video signal in 
accordance with the principles of the invention, by having one or more bits of watermark 
data carried via an average value of the chrominance component of each of various blocks 
of the video signal, on up to a per-frame basis. 

25 Shown in FIG. 1 are a) YUV demultiplexer (demux) and decimator 103, b) color 

selection 105, c) double-pole, double-throw switch 109, d) texture masking unit 111, 
e) multiplier 113, f) adder 115, g) multiplexer (mux) 117, h) bit mapper 123, and 
i) summer 133. Also shown in FIG. 1 are optional j) channel encoder 1 19, and k) block 
interleaver 121. 

30 YUV demultiplexer and decimator 103 receives a video signal to be watermarked, 

i.e., to have additional information added thereto. YUV demultiplexer and decimator 103 
may work with digital video, e.g., video formatted according to the Serial Digital 
Interface (SDI) standard. As will be recognized by those of ordinary skill in the art, any 
video signal not initially in an appropriate digital format may be converted thereto using 

35 conventional techniques. 
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YUV demultiplexer and decimator 103 demultiplexes the luminance (Y) 
component of the video and its chrominance component. The chrominance component of 
the video signal has two portions U and V, where U is the differential blue portion and V 
is the differential red portion. 

5 Much of the processing to embed the additional data on the chrominance 

component is, preferably, performed with a special decimated video format in which for 
each original 2x2 luminance block of video, had the original block been in 4-4-4 
representation, there remains only one Y, one U, and one V value. To this end, in the 
event the input video signal is actually in the so-called 4-4-4 format, the image is 

10 appropriately decimated by YUV demultiplexer and decimator 103 so that for each 
original 2x2 luminance block there is one Y, one U, and one V value. Similarly, in the 
event the input video signal is in the so-called "4-2-2" format, i.e., the luminance is full 
resolution while the chrominance portions are a) full resolution vertically only and b) half 
resolution horizontally, YUV demultiplexer and decimator 103 decimates the luminance 

15 component horizontally and vertically as well as decimates each chrominance portion 
only vertically. Likewise, in the event the input video signal is in the so-called 4-2-0 
format, i.e., the luminance component is full resolution while the chrominance portions 
are each only half resolution both vertically and horizontally, the luminance component of 
the image is decimated by YUV demultiplexer and decimator 103 so that for each 

20 original 2x2 luminance block had the original block been in 4-4-4 representation there 
remains only one Y, one U, and one V value. 

The preferred decimated video format may be supplied as an output to color 
selection 105. Thus, preferably, regardless of the format of the input video signal, further 
processing by the system preferably may be based on the decimated video signal such that 

25 for every 2x2 block of full resolution luminance pixels of the original input video signal 
there is one Y, one U, and one V value. Those of ordinary skill in the art will be able to 
develop their own methods, should they choose to do so, of developing one Y, one U, and 
one V value for every 2x2 block of luminance pixels. 

In order to know the format of the original video, a) an operator may indicate to 

30 YUV demultiplexer and decimator 103 the particular format of the video supplied to 
transmitter 101, b) the format of the video may be detected directly from the video using 
conventional techniques, or c)the information may be supplied from a higher layer 
processor which is supplying the input video signal. 

YUV demultiplexer and decimator 103 may also supply a second set of YUV 

35 outputs in the full format of the original input video signal to double-pole, double-throw 
switch 109. 
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Color selection 105 determines, for any particular pixel, on which portion of the 
chrominance component, i.e., on the U portion or the V portion, a change in value, if 
necessary, may be better accommodated without introducing a visible artifact. In one 
embodiment of the invention, color selection 105 is based upon a look-up table as 

5 described further hereinbelow. Alternatively, it may be based all or in part, on various 
computations, such as in prior United States Patent Application Serial No. 10/342,704. 

The output of color selection 105 is also used to control the position of 
double-pole, double-throw switch 109. More specifically, the output of color selection 
105 is set so that double-pole, double-throw switch 109 1) supplies, to adder 115, the 

10 portion of the chrominance component that has been selected to carry the watermark data; 
and 2) supplies, to YUV multiplexer 1 17, the portion of the chrominance component that 
was not selected. The output of color selection 105 is also supplied to multiplexer 117 
and to bit mapper 123 for use as described hereinbelow. 

Texture masking unit 1 1 1 analyzes the texture of the luminance area around each 

15 pixel in the decimated format supplied as output by YUV demux and decimator 103 to 
determine the maximum change in value that can be accommodated by that pixel without 
introducing visible artifacts, and supplies as an output a weight indicative thereof. The 
weight value may be coded, e.g., taking integral values from 1 to 5. Other values may be 
used, e.g., experiments have indicated that a value of up to 20 may be used in busy areas 

20 without visual degradation. The weight is supplied to multiplier 113. Texture masking 
unit 1 1 1 may put out a smaller value than the maximum distortion that can be introduced 
into a pixel as will be described hereinbelow. 

Note that the particular values used are at least partially dependent on the number 
of bits used to represent each Y, U, and V value. For example, the foregoing suggested 

25 weight values of 1 to 5, and a weight of even up to 20, are for Y, U, and V being 8 bit 
values. Those of ordinary skill in the art will readily recognize that the values employed 
for 8 bits may be scaled to 10 bits by multiplying by 4, e.g., shifting the value to the left 
two times. Likewise, other numbers of bits used for Y, U, and V can be similarly 
accommodated. 

30 Multiplier 113 multiplies the weight received from texture masking unit 1 1 1 by a 

value related to the information to be transmitted as part of this pixel, which is supplied 
by bit mapper 123. For example, the value supplied by bit mapper 123 may be -1, 0, or 1. 
The product produced by multiplier 1 13 is supplied to adder 1 15 and summer 133. 

Texture masking unit 111 is responsive to summer 133. In this regard, as noted, 

35 texture masking unit 1 1 1 may put out a smaller weight value than the change in value 
that can be introduced into a pixel in the event that it receives a signal to that effect from 
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summer 133. More specifically, summer 133 adds the values supplied by texture 
masking unit 111 for each block. Summer 133 supplies as an output to texture masking 
unit 111a maximum value that texture masking unit 1 1 1 can use as its output weight for 
the pixel currently being processed. The maximum value supplied by summer 133 is the 

5 lesser of the a) maximum weight value that can be accommodated by a pixel based on the 
texture surrounding it and b) the difference between a value supplied by bit mapper 123 
to summer 133 for the block and the current sum for the block. Thus, once the sum 
equals the value supplied by bit mapper 123 to summer 133 for the block, texture 
masking unit 1 1 1 outputs a zero for each remaining pixel of the block. 

10 Adder 115 produces a modified chrominance portion by adding the value supplied 

by multiplier 113 to the value of the portion of the chrominance which was selected by 
color selection 105 to carry the additional information for the pixel. As indicated, the 
portion of the chrominance that was selected by color selection 105 to carry the additional 
information is passed to adder 115 by double-pole, double-throw switch 109. The 

15 modified chrominance portion supplied by adder 1 15 is supplied to multiplexer 117. 

Texture masking unit 111, multiplier 113, bit mapper 123 and summer 133 
cooperate to effectively upsample the value being added to each pixel of the special 
processing resolution to match the format of the chrominance of the original video signal. 
To this end, the resulting upsampled values may be added to the selected chrominance 

20 portion of each pixel in the original video signal that corresponds to the location of a 
pixel in the special reduced resolution format used for processing. For example, if the 
original video signal is in 4-2-2 format, the values determined to be added to each of the 
pixels of a block in the special processing format are duplicated on a per-line basis so as 
to create a block of values to be added that has 8 pixels per line and 16 lines per block. In 

25 this block, each of the lines of the nonoverlapping groups of 2 consecutive lines has 
identical values to be added. Such a block corresponds in size to the original block of the 
selected chrominance portion of the original video in 4-2-2 format. Each value of the 
resulting upsampled block is added to the selected chrominance portion of the respective, 
like positioned pixel in the original video signal by adder 115. Those of ordinary skill in 

30 the art will readily be able to perform similar block conversions for different formats. 
Note that for those pixels of a block that color selection 105 did not determine that the 
selected chrominance portion could better accommodate a change, the value added will 
be zero. If the original video signal is in 4-2-0 format, no upsampling is required. 

In another embodiment of the invention, only the decimated special processing 

35 resolution format is processed. The resulting modified chrominance portion is then 
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upsampled, e.g., in multiplexer 117. However, doing so may result in some degradation 
of the original video signal, although such degradation need not be visible. 

Multiplexer 117 receives the original luminance component (Y) and the 
unmodified chrominance portion that was supplied from YUV demultiplexer and 

5 decimator 103 via double-pole, double-throw switch 109. Multiplexer 117 also receives 
the modified chrominance portion from adder 115. Multiplexer 117 then multiplexes 
together the original luminance component (Y), the unmodified chrominance portion, and 
the modified chrominance portion. Multiplexer 1 17 knows on which lead it receives the 
modified portion of the chrominance component and on which lead it receives the 

10 unmodified portion of the chrominance component by virtue of receiving the output of 
color selection 105. In accordance with an aspect of the invention, the resulting video 
signal is supplied as the watermarked output video signal. 

Those of ordinary skill in the art will be able to develop embodiments of the 
invention in which the additional data is added to the original chrominance signal portion 

15 rather than the decimated version thereof, so that upsampling will not be required. 

As indicated above, the binary data value, i.e., 1 or 0, of the additional 
information which is to be transmitted for each block may be supplied directly to bit 
mapper 123 for use as the watermark data or it may first be processed to facilitate the 
processing and recovery of the information at the receiver. Such exemplary processing 

20 may be performed by optional channel encoder 119 and block interleaver 121 . 

Channel encoder 119 receives the additional data that is desired to be embedded 
in the video stream. This data is then encoded, e.g., using a forward error correcting 
coding scheme. Such forward error correcting scheme may be any conventional forward 
error correcting scheme, such as convolutional encoding, e.g., Viterbi encoding or turbo 

25 encoding, or it may be any newly developed coding scheme. In one exemplary 
embodiment of the invention, convolutional coding of rate one-half is used. As a result 
of such coding, two bits are produced for every bit of the original bit stream. The channel 
encoded bit stream is supplied as an output by channel encoder 1 19 to block interleaver 
unit 121. 

30 Block interleaver 121 rearranges the order of the bits of the channel encoded bit 

stream in order to randomly distribute the data. Doing so helps reduce the chance that 
adjacent sections of the channel encoded bit stream are lost, e.g., due to bursts of noise or 
other factors, which would then make it difficult to recover such data at the receiver from 
the remaining, actually received data. In an exemplary embodiment of the invention, the 

35 number of bits that are interleaved as a unit is equal to the number of blocks in a frame. 
A block interleaver may be implemented by writing data sequentially to the rows of a 
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block left to right, at the end of each row starting again at the leftmost position of the next 
row down, and then reading the data by starting at the leftmost topmost position of the 
block and reading down a column until the end of the column is reached at which point 
reading continues at the top of the next column. A block interleaver of 45 rows by 30 

5 columns has proven effective for a picture size of 720 by 480 pixels. For different 
resolutions, those of ordinary skill in the art will be readily able to develop comparable 
block encoders. The interleaved channel encoded bit stream is supplied as an output by 
bit interleaver 121 to bit mapper 123. 

In accordance with an aspect of the invention, the data bit supplied by block 

10 interleaver 121 is impressed as the watermark data, under the control of bit mapper 123, 
upon at least one block of at least one frame of the original video signal. In accordance 
with the principles of the invention, bit mapper 123 controls the insertion of the 
watermark data into one of the bit positions of the average value of at least a selected one 
of the chrominance portions of each block upon which the data is to be impressed, thus 

1 5 effectively replacing the bit at that bit position. 

For example, when the watermark data is to be carried in the least significant bit 
of the integer portion of the average of the selected chrominance portion of the block, the 
value that needs to be added to the average value is 0 or 1 . Zero is added when the least 
significant bit of the integer portion of the average value is already the same as the 

20 watermark data bit to be carried and 1 is added when the least significant bit of the integer 
portion of the average value is the complement of the watermark data bit to be carried. 
When the watermark data is to be carried in the second to the least significant bit of the 
integer portion of the average of the selected chrominance portion of the block, the value 
of the data to be added to the pixel is -1, 0, or 1. Zero is added when the second least 

25 significant bit of the integer portion of the average value is already the same as the 
watermark data bit to be carried and 1 or -1 is added when the second least significant bit 
of the integer portion of the average value is the complement of the watermark data bit to 
be carried. Whether 1 or -1 is added depends on which will cause the smallest change to 
the average value while changing the second least significant bit of the integer portion of 

30 the average value to its complement. Using the second to least significant bit the data to 
be embedded is more likely to survive encoding by MPEG or a similar process. When 
the data to be placed in the third to the least significant bit of the integer portion of the 
average of the selected chrominance portion of the block, the value of the data to be 
added to the pixel is -2, -1, 0, 1, or 2. Zero is added when the third least significant bit of 

35 the integer portion of the average value is already the same as the watermark data bit to be 
carried and is -2, -1, 1, or 2 is added when the third least significant bit of the integer 
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portion of the average value is the complement of the watermark data bit to be carried. 
Whether is -2, -1, 1, or 2 is added depends on which will cause the smallest change to the 
average value while changing the third least significant bit of the integer portion of the 
average value to its complement. Using the third to least significant bit the data to be 

5 embedded is even more likely to survive encoding by MPEG or a similar process to 
achieve adequate results. From the foregoing, those of ordinary skill in the art will 
readily be able to determine the values to be added for more significant bit positions 
which are determined by the user or the system. 

To this end, bit mapper 123 develops a value that is distributively added to a 

10 selected chrominance portion of the pixels of a block such that doing so changes the 
average of the value of that chrominance portion for that block so that the bit supplied by 
block interleaver 121 that is being impressed is placed in a selected bit position of the 
average value of the selected chrominance portion. This value is the value to be added to 
the average value of the selected chrominance portion to place the watermark data bit in 

15 the appropriate bit position multiplied by the number of pixels in a block. In other words, 
the value developed by bit mapper 123 that is to be added to the average of the value of 
that chrominance portion is divided up into smaller values that are added to individual 
pixels of the block, so that the total of the smaller values added to the block divided by 
the number of pixels in the block equals the value to be added to the average value of the 

20 selected chrominance portion. 

The particular bit average of the value of the chrominance portion for that block, 
e.g., the DC coefficient for that chrominance portion, onto which the data supplied by bit 
mapper 123 is impressed, is determined by bit mapper 123. In an exemplary embodiment 
of the invention, the second least significant bit of the DC coefficient for a block is 

25 replaced with the particular value that is desired to be impressed on the block. In another 
embodiment of the invention, which bit of the DC coefficient that is replaced may be a 
function of the texture variance of the block. It is advantageous to increase the 
significance of the bit which is replaced as the texture variance increases, because the 
MPEG coding standards employ greater quantization step sizes for higher texture 

30 variances, and the use of such greater quantization step sizes could filter out the 
watermark data bit if it is positioned in a bit position that is not significant enough. When 
using more significant bits, the values to be added or subtracted from the DC coefficient 
in order to change the bit being substituted to its complementary value may be greater 
than one. To this end, in accordance with an aspect of the invention, bit mapper 123 

35 receives the average variance of the luminance component for the block from texture 
masking 111, and based on the average variance, determines which bit position is to be 
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replaced. The greater the variance, the more significant the bit position into which the 
watermark data is placed. 

Bit mapper 123 supplies the data bit from the interleaved channel encoded bit 
stream that is to be communicated for each block of the original video signal at the 

5 appropriate time for each pixel of the block of the original video signal when that pixel is 
to be incorporated into the watermarked output video signal. Thus, bit mapper 123 takes 
into account the fact that the processing of the video signal is line based, i.e., the 
processing is left to right on a line, then down to the next line and left to right again, 
causing the adjacent pixels of a block to not necessarily be located sequentially in the 

10 video stream and therefore to not all be processed in time directly one after the other. The 
particular data bit supplied as an output of bit mapper 123 at any time is supplied as an 
input to multiplier 113. 

Using an encoder, such as shown in FIG. 1, a bit rate of around 6,750 bits per 
second, substantially error free, has been achieved for the additional information as 

15 supplied to channel encoder 119 when the video frame size is 720 x 480 pixels. 

Those of ordinary skill in the art will readily recognize from the above description 
that various ones of the units in FIG. 1 require storage in order to first determine the 
values which must be computed using information from an entire block, e.g., the original 
average value of the block and the average texture variance of the block, and then to 

20 employ those values in processing the individual pixels. Consequently, there is typically 
a one slice delay, where a slice is a strip of blocks horizontally all the way across a frame. 

FIG. 2 shows exemplary receiver 201 for recovering the additional data of a video 
signal containing digital watermarking on the chrominance signal thereof, in accordance 
with the principles of the invention. Shown in FIG. 2 are a) YUV demultiplexer (demux) 

25 and decimator 203, b) color selection unit 207, c) double pole double throw switch 209, 
d) block variance calculation 211, e) block integrator V 213, f) block integrator U 215, 
g) bit selection 217, h) deinterleaver 219, and i) channel decoder 221 . 

YUV demultiplexer and decimator 203, which may be substantially the same as 
YUV demultiplexer and decimator 103 of transmitter 101 (FIG. 1), receives a video 

30 signal that has been digitally watermarked in that additional information has been added 
thereto on the chrominance component of the signal, in accordance with the principles of 
the invention. YUV demultiplexer and decimator 203 works with digital video, e.g., 
formatted according to the serial digital interface (SDI). As will be recognized by those 
of ordinary skill in the art, any video signal not initially in an appropriate digital format 

35 may be converted thereto using conventional techniques. 
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YUV demultiplexer and decimator 203 demultiplexes the luminance (Y) 
component of the video and its chrominance component and decimates it to the preferred 
processing format in which for each original 2x2 luminance block of video, had the 
original block been in 4-4-4 representation, there remains only one Y, one U 5 and one V 

5 value. In order to know the format of the received video, a) the operator needs to indicate 
to YUV demultiplexer and decimator 203 the particular format of the input video, b) the 
format of the video may be detected directly from the video using conventional 
techniques, or c) the information may be supplied from a higher layer processor which is 
supplying the input video signal. The demultiplexed luminance and chrominance 

10 components are supplied to color selection 207. In addition, the luminance component is 
supplied to block variance calculation 211, the V chrominance portion is supplied to 
block integrator V 213, and the U chrominance portion is supplied to block 
integrator U 215. Unlike YUV demultiplexer and decimator 103, YUV demultiplexer 
and decimator 203 need not also supply a second set of YUV outputs in the full format of 

1 5 the original input video signal. 

Color selection unit 207 determines for each block on which portion of the 
chrominance component, i.e., on the U portion or the V portion, it was likely that the 
additional information was embedded. The output of color selection unit 207 is used to 
control the position of double pole double throw switch 209. More specifically, color 

20 selection unit 209 selects the chrominance portion U or V, as a function of Y, U, and V, 
as will be described in more detail hereinbelow, on which the additional information was 
likely to have been embedded for this block. In one embodiment of the invention, color 
selection unit 207 is based on a lookup table. Doing so simplifies the process by avoiding 
the need for YUV to RGB conversion, which might otherwise be necessary. 

25 Note that the input to color selection unit 207 is individual pixels. Color selection 

unit 207 keeps track of the pixels in each block and combines the individual U or V 
selection for each pixel in the block. The particular component that has the highest value, 
i.e., was most often selected for the pixels within a block, is determined to be the output 
of color selection 207. The output of color selection unit 207 is then set so that switch 

30 209 supplies to bit selection 217 the integrated version of the portion of the chrominance 
component to which the additional data was determined to have been added. 

Block variance calculation 21 1 determines the particular bit of the average of the 
value of the selected chrominance portion for that block, e.g., the DC coefficient for the 
selected chrominance portion, that likely contains the impressed data. As noted, in an 

35 exemplary embodiment of the invention, bit mapper 123 (FIG. 1) received and employed 
the average of the variances of the luminance component of the pixels of the block, to 
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determine which bit position is to be replaced with the watermark data bit to be 
impressed. The greater the variance, the more significant the bit position that should be 
replaced. Block variance calculation 21 1 (FIG. 2) should base its calculation on the same 
information used by mapper 123 to replicate its determination. The output of block 

5 variance calculation 2 1 1 is supplied to bit selection 217. 

Block integrator V 213 integrates the values of V over a block, i.e., the values for 
each pixel in a block are combined, e.g., added together. Block integrator U 215 
integrates the values of U over a block, i.e., the values for each pixel in a block are 
combined, e.g., added together. 

10 Bit selection 217 extracts the bit at the bit position specified by block variance 

calculation 211 from the integrated chrominance portion value supplied to it by switch 
209 as the data for the block. 

Deinterleaver 219 reorders the data to undo the effect of block interleaver 121 
(FIG. 1) of transmitter 101. The reordered values are then supplied to channel decoder 

15 221 (FIG. 2), which performs appropriate decoding for a signal that was encoded using 
the type of encoding employed by channel encoder 119 of transmitter 101 (FIG. 1). The 
resulting decoded values are supplied by channel decoder 221 (FIG. 2) as the 
reconstructed version of the additional data signal. For further robustness, channel 
decoder 221 may be a so-called "sequence decoder", e.g., a turbo decoder. 

20 FIGs. 3A and 3B, when connected together as shown in FIG. 3, show an 

exemplary process for use in watermarking one of the chrominance portions with 
additional data, in accordance with the principles of the invention. For those blocks 
where the determined bit position is already the same as the value to be impressed, the 
block may be transmitted unmodified. The process of FIG. 3 may be performed, in an 

25 exemplary embodiment of the invention, in a system such as is shown in FIG. 1 . 

The process may be entered in step 301 when all the pixels of a block are 
available. Part of the processing of FIG. 3 takes place on a block-by-block basis, and part 
on a pixel-by-pixel basis. The blocks of a frame are indexed using a two-dimensional 
pointer p,q, where p points to the particular horizontal slice of the frame that is being 

30 processed and q points to the particular column, or vertical slice, of the frame. For 
example, for 720x480 resolution p ranges between 1 and 30 and q between 1 and 45. 
Similarly, the pixels of each block are indexed using a two-dimensional pointer ij, where 
i points to the particular row within the block that is being processed and j points to the 
particular column within the block that is being processed. For example, in the special 

35 processing mode employed to impress the data, where each macroblock of original video 
has only a corresponding 8x8 block of Y, U, and V, both / and j range between 0 and 7. 
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After entering the process in step 301 , several variables that are used in the 
process are initialized in step 303, e.g., countU(p,#)=0, count V(/?,#)=0, sumU(p,#)=0, 
sumV(p,#)=0, and var(p,^)=0. CountU is a running total of how many pixels within the 
block are selected by the color selection process as being suitable for watermarking on the 

5 U chrominance portion while count V is a running total of how many pixels within the 
block are selected by the color selection process as being suitable for watermarking on the 
V chrominance portion. SumU and sumV are the running total values of U and V 
respectively over all the pixels of the block. In embodiments of the invention where 
watermarking is only performed only on pixels of the chrominance portion selected for 

10 the block, there is no use for the one of sumU and sumV that is developed for the 
chrominance portion that is not selected. 

In step 305, var(##), the total of the variance of the luminance for each individual 
pixel within the block, which is, of course, proportional to the average variance of the 
luminance for the block, is computed. To this end, i and j are initially both set to point to 

15 the first pixel of the block to be processed, e.g., i=0 and y=0. The value of var(##), is 
computed by cycling through each pixel of the block, changing the values of i and j as 
appropriate to do so, and adding together the variance of the luminance for each pixel to 
the current total of var(p,q). 

In one embodiment of the invention, the variance of the luminance for any 

20 particular pixel may computed by taking the absolute value of the difference in the 
luminance between the pixel and all of its nearest neighbors. Mathematically, where all 
of the nearest neighbors are within the same block, this may be written as 
varfo 0=vai<p, q)+Q Y™ - H W - «) H W " «) H W " Y ijf H Y ijf " 

y(/>>9) i,| y(P»?) y(P^) ■ . | y(p,i) y(P,<i) 1. 1 y(P><i) y(P^) i . | yip^) i\ 
^I+IJ+OH J (IJ) mI (i+\jy n Z (/,7) ~ I (iJ+iV i J iU) mI {i-U+l)~* £ (U) 

25 Those of ordinary skill in the art will readily be able to adapt the foregoing to 

those pixels whose nearest neighbors are in other blocks. Furthermore, for those blocks 
that are near the borders of the frame, and hence have no nearest neighbors, or the nearest 
neighbors are part of those blocks that are not displayed, the value of such neighbors may 
be considered to be zero. 

30 In accordance with another aspect of the invention, not all of a pixel's nearest 

neighbors need be considered in the variance computation and yet sufficiently high 
quality results can be achieved. More specifically, it is advantageous in that computation 
time for each pixel is reduced by taking only the differences of the 4 pixels in the corners 
of the rectangle surrounding the pixel and 2 of the other pixels that form a vertical or 

35 horizontal line with the pixel, e.g., the 2 pixels on the horizontal line with the pixel. 
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Thereafter, conditional branch point 307 tests to determine which particular 
chrominance portion, i.e., U or V, is going to contain the watermark information for the 
block. This is done by evaluating the color selection for each pixel in the block and 
counting the number of pixels within the block that are selected for each chrominance 
5 portion. The chrominance portion that was selected the most for the block is chosen for 
watermarking. Note that in some embodiments of the invention, it may be determined 
that a particular pixel is unsuitable for watermarking at all. In such a case, it is not 
counted towards the total number of pixels for either U or V. 

The particular method of determining the color selected to be watermarked for 
10 each pixel is at the discretion of the implementer. In one embodiment of the invention, 
the chrominance portion of the pixel with the smallest value is selected. In another 
embodiment of the invention, the color selection arrangement described hereinbelow is 
employed. 

Next, the bit position of the average value of the selected chrominance portion 
15 that will contain the watermarked bit is determined. The bit position is selected so that 

the watermarked bit will survive any subsequent quantization, such as takes place in 

MPEG-like encoding. 

To this end, if the test result in step 307 is that the V chrominance portion is 

selected to be watermarked, control passes to step 309, in which a variable 
20 watermarkcolor is set equal to V. Thereafter, conditional branch point 323, which tests 

to determine whether the average Y variance over the block, var(p,q), is greater than a 

first prescribed V threshold tlv, which is the largest V threshold. An exemplary value of 

tlvis 600. 

Note that the particular threshold values used in connection with FIGs. 3 and 4 for 
25 both U and V are at least partially dependent on the number of bits used to represent each 

Y value, when the average Y variance is compared with the suggested threshold. For 
example, the suggested threshold values herein are for Y being an 8 bit value. Those of 
ordinary skill in the art will readily recognize that the values employed for 8 bits may be 
scaled to 10 bits by multiplying by 4, e.g., shifting the value to the left two times. 

30 Likewise, other numbers of bits used for Y, U, and V can be similarly accommodated. 

In other embodiments of the invention, instead of using the average Y variance 
over the block for the various comparisons, a different average variance, e.g., the average 

V variance over the block, may be calculated and employed. 

If the test result in step 323 is YES, indicating that the variance is large enough 
35 that the additional data should be encoded on the 5 th least significant bit of the average of 
the V values of the pixels of the block, e.g., the value of int[sumV(p,q)/(number of pixels 
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per block)], e.g., int[sumV(p,q)/64], is greater than tlv, control is passed to step 325, in 
which a variable m is set equal to 5. 

Note that instead of using the integer function int for rounding, as is used herein, 
any other form of rounding to achieve an integer value may be employed, e.g., always 

5 rounding up or always rounding to the nearest integer value. 

If the test result in step 323 is NO, indicating that the variance was not large 
enough that additional the data should be encoded on the 5 th least significant bit of the 
average value of the V values of the pixels of the block, control passes to conditional 
branch point 329, which tests to determine if the average Y variance over the block, 

10 vax(p,q), is greater than a second prescribed V threshold, t2v, which is the second largest 
V threshold. An exemplary value of t2v is 15. 

If the test result in step 329 is YES, indicating that the additional data should be 
encoded on the 4 th least significant bit of the average of the V values of the pixels of the 
block, control is passed to step 331, in which variable m is set equal to 4. 

15 If the test result in step 329 is NO, indicating that the variance was not large 

enough that the additional data should be encoded on the 4 th least significant bit of the 
average of the V values of the block, control passes to conditional branch point 333, 
which tests to determine if the average Y variance over the block, var(p,g), is greater than 
a third prescribed V threshold, t3v, which is the smallest V threshold. An exemplary 

20 value of t3v is 7. 

If the test result in step 333 is YES, indicating that the variance is large enough 
that the data should be encoded on the 3 rd least significant bit of the average of the V 
values of the pixels of the block, control is passed to step 335, in which variable m is set 
equal to 3. 

25 If the test result in step 333 is NO, indicating that the variance is only large 

enough that the data should be encoded on the 2 nd least significant bit of the average 
value of the V value of the block control is passed to step 337, in which variable m is set 
equal to 2. 

If the test result in step 307 is that the U is the chrominance portion is selected to 
30 be watermarked, control passes to step 311, in which the variable watermarkcolor is set 
equal to U. Thereafter, conditional branch point 343 tests to determine whether the 
average Y variance over the block, vai(p,q), is greater than a first prescribed threshold 
tlu, which is the largest threshold. An exemplary value of tlu is 600. 

In other embodiments of the invention, instead of using the average Y variance 
35 over the block for the various comparisons, the average U variance over the block may be 
calculated and employed. 
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If the test result in step 343 is YES, indicating that the variance is large enough 
that the data needs to be encoded on the 5 th least significant bit of the average of the U 
values of the pixels of the block, e.g., the value of int[sumV(p,q)/(number of pixels per 
block)], e.g., int[sumU(p,q)/64], is greater than tlu, control is passed to step 345, in 
5 which variable m is set equal to 5. 

Note that instead of using the integer function int for rounding herein, any other 
form of rounding to achieve an integer value may be employed, e.g., always rounding up 
or rounding to the nearest integer value. 

If the test result in step 343 is NO, indicating that the variance was not large 
10 enough that the data needed to be encoded on the 5 th least significant bit of the average of 
the U values of the pixels of the block, control passes to conditional branch point 349, 
which tests to determine if the average Y variance over the block, var(/?,#), is greater than 
a second prescribed threshold t2u, which is the second largest U threshold. An exemplary 
value oft2u is 15. 

15 If the test result in step 349 is YES, indicating that the data needs to be encoded 

on the 4 th least significant bit of the average of the U values of the pixels of the block, 
control passes to step 351, in which variable m is set equal to 4. 

If the test result in step 349 is NO, indicating that the variance was not large 
enough that the data should be encoded on the 4 th least significant bit of the average of 

20 the U value of the pixels of the block, control passes to conditional branch point 353, 
which tests to determine if the average Y variance over the block, var(p,#), is greater than 
a third prescribed threshold t3u, which is the smallest U threshold. An exemplary value 
of t3u is 7. 

If the test result in step 353 is YES, indicating that the variance is large enough 
25 that the data should be encoded on the 3 rd least significant bit of the average of the U 
values of the pixels of the block, control passes to step 355, in which variable m is set 
equal to 3. 

If the test result in step 353 is NO, indicating that the variance is only large 
enough that the data should be encoded on the 2 nd least significant bit of the average of 
30 the U values of the pixels of the block, control is passed to step 357, in which variable m 
is set equal to 2. 

Once the particular bit of the average value over the block of the selected 
chrominance portion to be employed to contain the watermarked data is determined, the 
process to make certain that that bit position contains the desired bit is undertaken. The 
35 goal of the process is to add or subtract the minimum possible value from the current 
average value of the selected chrominance portion to make certain that the desired bit 
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position has the value of the watermarking bit to be transmitted. Note that, in one 
embodiment of the invention, the desired bit position is a bit position within the integer 
portion of the average value. To this end, ideally, if the desired bit position already 
contains the value of the watermarking bit to be transmitted, nothing may be added to the 

5 current average value of the selected chrominance portion. On the other hand, if the 
desired bit position contains the complement of the value of the watermarking bit to be 
transmitted, ideally, only the smallest possible value that will flip the desired bit position 
to its complement by being either added to or subtracted from the desired bit position, and 
hence causing the least change in the value of the average value of the selected 

10 chrominance portion from its current unwatermarked value to its final watermarked 
value, is added to or subtracted from the desired bit position as appropriate. 

In practice, due to quantization noise, rounding as part of the inventive process, 
and other factors of the MPEG-like encoding process that may impact the final value of 
the desired bit, a slightly different value may be added or subtracted as explained further 

15 herein. More specifically, in one embodiment of the invention, a "safe" range of values 
having the desired bit value at the desired bit position is selected, and the minimum value 
is either added or subtracted to the average value of the selected chrominance portion so 
that the final value has the desired bit value at the desired bit position and it is within the 
safe range. Thus, typically, whenever a bit of the average value needs to be changed to its 

20 complement to carry the watermark data, the resulting value is always at the border of a 
safe range. When the value at the desired bit position is already the value of the 
watermark data bit to be transmitted, if the average value of the selected chrominance 
portion is already within the safe range, then nothing needs to be added to the average 
value of the selected chrominance portion. However, when the average value of the 

25 selected chrominance portion is not already within the safe range, then the minimum 
value necessary to change the average value of the selected chrominance portion to be a 
value within the safe range, while keeping the value of the desired bit position at the 
value of the watermarking bit to be transmitted, is added to, or subtracted, from the 
average value of the selected chrominance portion. 

30 Conceptually, the foregoing may be thought of as first adding or subtracting the 

minimum value to achieve the desired watermarking value at the desired bit position, and 
then adding or subtracting a further amount, e.g., a margin value, to insure that the final 
value is within the safe range. 

FIG. 5 shows an example of several safe ranges where the desired bit position is 

35 the third least significant bit. Along the axis are shown the average value of the selected 
chrominance portion 
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Table 1 shows (code) (table of values) 

Upon completion of steps 325, 331, 335, 337, 345, 351, 355 and 357, control 
passes to conditional branch point 361, which tests to determine if the bit of 
watermarking data to be impressed on the block is the same as the current identified bit 

5 position for the average value of the chrominance portion identified by the variable 
watermarkcolor. If the test result in step 361 is YES, indicating that the bit of 
watermarking data to be impressed on the block is the same as the current identified bit 
position for the average value of the chrominance portion identified by the variable 
watermarkcolor, and that therefore the bit does not need to be changed to its 

10 complementary value, control passes to step 363, which tests to determine if the value is 
within the safe range for the current bit position. If the test result is NO, indicating that 
an error might be introduced during subsequent processing, control passes to step 365, 
which sets the variable changevalue to be equal to the value needed to move the current 
average value for the color indicated by watermarkcolor into the nearest safe range 

15 without changing the value of the desired bit position. Note that the value need not be an 
integer value, and it may also be a negative value. If the test result in step 363 is NO, 
indicating that the current average value for the color indicated by watermarkcolor is 
already within a safe range, control passes to step 367, and the value of changevalue is set 
equal to zero. 

20 If the test result in step 361 is NO, indicating that the bit of watermarking data to 

be impressed on the block is not the same as the current identified bit position for the 
average value of the chrominance portion identified by the variable watermarkcolor, and 
that therefore the value of the bit must be changed to its complementary value so as to 
properly carry the watermarking data, control passes to step 369, which tests to determine 

25 if the nearest safe range for the current bit position is greater or smaller than the current 
average value of the color indicated by watermarkcolor. If the test result in step 369 is 
GREATER, indicating that the values of the nearest safe range for the current bit position 
is greater than the current average value of the color indicated by watermarkcolor, control 
passes to step 371 in which the value of variable changevalue is set equal to the smallest 

30 value to add to the average value so that the resulting value is within the adjacent safe 
range with bigger values. Note that this value need not be an integer value. If the test 
result in step 369 is SMALLER, indicating that the values of the nearest safe range for the 
current bit position is smaller than the current average value of the color indicated by 
watermarkcolor, control passes to step 373 in which the value of variable changevalue is 

35 set equal to the smallest negative value that when added to the average value results in a 
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value that is within the adjacent safe range with smaller values. Again, note that this 
value need not be an integer value, and it may also be a negative value. 

Upon conclusion of step 365, 367, 371, or 373, control passes to step 375 in 
which the total to add to the pixels is set equal to the product of the number of pixels per 

5 block and the value of changevalue. If the resulting product value is not an integer, the 
value is rounded off. The rounding may be performed in a manner consistent with the 
steps 365, 371, and 373, in that if a negative value was added, the rounding is down by 
taking the integer portion of the value, while if a positive value was added the rounding is 
up toward the next whole integer value. 

10 Processing now changes from a per-block level to a per-pixel level within the 

block. In step 377, the first pixel of the block is pointed to. Thereafter, conditional 
branch point 379 tests to determine if the current pixel is to be watermarked, based on its 
color. This is done by determining if the chrominance component of this pixel that is 
suitable for watermarking is the same as the color selected in step 307 for the entire 

15 block. If the test result in step 379 is YES, indicating that this pixel should be 
watermarked, control passes to step 381, in which a value is added to the current pixel 
based on the luminance variance for the pixel and the total values added so far to the 
pixels of the block. 

More specifically, a maximum value that can be added to the pixel without 
20 introducing a visible artifact is determined as a function of the variance of the luminance. 
The greater the variance of the luminance, the greater the value that can be added, up to a 
prescribed maximum. Note that this value may be positive or negative. This value is 
then added to pixel if the total to be added to the pixels is a positive value, or the value is 
subtracted from the pixel if the total to be added to the pixels is a negative value. 
25 However, as the per-pixel processing proceeds running total of the values added or 
subtracted are subtracted from the total to be added to the pixels. If the value to be added 
to the current pixel will make the difference between the total to be added to the pixels 
and the running total cross zero, then the value is adjusted so that the running total just 
equals zero. 

30 If the test result in step 379 is NO, or after completing step 381, control passes to 

conditional branch point 383, which tests to determine if the current pixel is the last pixel 
of the block. If the test result in step 383 is NO, control passes to step 385 which tests to 
determine if the total to be added to the pixels of the block has already been added, i.e., is 
the running total equal to the total to be added to the pixels of the block. If the test result 

35 in step 385 is NO, indicating that there is more that needs to be added to the pixels of the 
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block, control passes to step 387, which points to the next pixel of the block. Control 
then passes back to step 379, and the process continues as described above. 

If the test result in either of steps 383 or 385 is YES, indicating that either all the 
pixels of the block have been processed or all of the total that need to be added has been 

5 added, control passes to step 389 and the process is exited. 

FIGs. 4A and 4B, when connected together as shown in FIG. 4, show an 
exemplary process for extracting the additional information from a digitally watermarked 
video signal in which the additional information that constitutes the watermarking signal 
within the video signal has been impressed upon the chrominance component, in 

10 accordance with the principles of the invention. Such a process may be implemented by 
an exemplary embodiment of the invention, such as the one shown in FIG. 2, across color 
selection 207, double pole double throw switch 209, block variance calculation 211, 
block integrator V 213, block integrator U 215 and bit selection 217 (FIG. 2). 

The process is entered in step 401 (FIG. 4) when a new block of the received 

15 decimated frame is to be processed. Note that for pedagogical purposes it is assumed 
herein that pixels are supplied for processing by the process of FIG. 4 grouped by block, 
so that all the pixels of a block are processed prior to any pixels of the next block being 
processed. However, in designing an actual system, those of ordinary skill in the art will 
readily recognize that the pixels may be processed in the same order that they are scanned 

20 and that appropriate memory locations and control structures may be used so as to 
effectively separately process the blocks. 

Part of the processing of FIG. 4 takes place on a block-by-block basis, and part on 
a pixel-by-pixel basis. The blocks of a frame are indexed using a two-dimensional 
pointer p,q, where p points to the particular horizontal slice of the frame that is being 

25 processed and q points to the particular column, or vertical slice, of the frame. For 
example, for 720x480 resolution, p ranges between 1 and 30 and q between 1 and 45. 
Similarly, the pixels of each block are indexed using a two-dimensional pointer ij, where 
i points to the particular row within the block that is being processed and j points to the 
particular column within the block that is being processed. For example, in the special 

30 processing mode employed to impress the data, where each macroblock of original video 
has only a corresponding 8x8 block of Y, U, and V, both i and j range between 0 and 7. 

After entering the process in step 401, several variables that are used in the 
process are initialized in step 403, e.g., countU(p,?)=0, countV(p,qr)=0 s sum\J(p,q)=Q 9 
sumV(p,#)=0, and vai(p,q)=0. CountU and countV are a running total of how many 

35 pixels within the block were selected by the color selection process as being U and V, 
respectively, while sumU and sumV are the running total values of U and V, respectively, 
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over all the pixels of the block. For the block, i and j are both set to point to the first 
pixel of the block to be processed, e.g., i=0 and j=0 as well. For each block, var(p,q) 
represents the total of the variance of the luminance for each individual pixel within the 
block, which is, of course, proportional to the average variance of the luminance for the 
5 block. 

Thereafter, in step 405, the Y, U and V values for the currently pointed to pixel of 
the currently being processed block is obtained, e.g., the values of Y^jf.U^f, and 

V { ] P jf are obtained. The current values of U and V are added to the respective current 

values of sumU and sumV in step 407. Also in step 407 the variance of the luminance, 
10 vzr(p,q\ is updated by adding the variance of the luminance for the current pixel to the 
current total of var(p,</). In one embodiment of the invention, the variance of the 
luminance for the current pixel may computed by taking the absolute value of the 
difference in the luminance between the current pixel and all of its nearest neighbors. 
Mathematically, where all of the nearest neighbors are within the same block this may be 
15 written as 

var(p, q)=vMp, q)H\ - «-) H Y ijf " #tf> H W " Ku% M Y ij? " W M Y ijf ' 

Those of ordinary skill in the art will readily be able to adapt the foregoing to 
those pixels whose nearest neighbors are in other blocks. Furthermore, for those blocks 

20 that are near the borders of the frame, and hence have no nearest neighbors, or the nearest 
neighbors are part of those blocks that are not displayed, the value of such neighbors may 
be considered to be zero. 

In accordance with another aspect of the invention, not all of the nearest neighbors 
need be considered and yet sufficiently high quality results can be achieved. More 

25 specifically, it is advantageous in that computation time is reduced to take the differences 
of the 4 pixels in the corners of the rectangle surrounding the current pixel and 2 of the 
other pixels that form a vertical or horizontal line with the current pixel, e.g., the 2 pixels 
on the horizontal line with the current pixel. However, the decoder should match the 
same process that was employed in the encoder. 

30 Control passes to conditional branch point 409, which tests to determine on which 

of U or V it was likely that the additional data was impressed. The details of this 
determination will be described further hereinbelow. If the test result in step 409 is U, 
indicating that the additional data was most likely impressed on U for the current pixel, 
control passes to step 411, in which countU is incremented. Control then passes to step 

35 413. If the test result in step 409 is V, indicating that the additional data was most likely 
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impressed on V for the current pixel, control passes to step 415, in which countV is 
incremented. Control then passes to step 413. 

In some embodiments of the invention, conditional branch point 409 may be a 
three-way test, with an additional result indicating that it is likely that data was not 

5 impressed on the pixel at all, i.e., not on the U or the V. If such is the result, control 
simply passes directly to step 413. 

Conditional branch point 413 tests to determine if the current pixel is the last pixel 
of the current block. If the test result in step 413 is NO, indicating that there remains 
additional pixels in the current block that have yet to be processed, control passes to step 

10 417, in which the values of i and j are adjusted to point to the next as-of-yet-not- 
processed pixel. Control then passes back to step 405 and the process continues as 
described above. If the test result in step 413 is YES, indicating that all the pixels of the 
current block have been processed, control passes to step 419, in which the variance of 
the decimated luminance for the block is calculated, i.e., the variance of the 8x8 Y block 

15 is calculated. 

Control then passes to conditional branch point 421, which tests to determine if 
countV>countU for the current block. If the test result in step 421 is that countV is 
indeed greater than countU, control passes to conditional branch point 423, which tests to 
determine whether the average Y variance over the block, var(p,<7), is greater than a first 
20 prescribed threshold tlv, which is the largest V threshold. An exemplary value of tlv is 
600. 

In other embodiments of the invention, instead of using the average Y variance 
over the block for the various comparisons, the average U or the average V variance over 
the block may be calculated and employed, e.g., whichever has the greater count value. 

25 If the test result in step 423 is YES, indicating that the variance is large enough 

that the data was likely to have been encoded on the 5 th least significant bit of the integer 
portion of the average of the V values of the pixels of the block, e.g., the value of 
int[sumV(p,q)/(number of pixels per block)], e.g., int[sumV(p,q)/64], control is passed to 
step 425, in which a variable m is set equal to 5. Control then passes to step 427, in 

30 which the value of the m** 1 least significant bit of the average of the V values of the pixels 
of the block is extracted as the value impressed upon this block. The process is then 
exited in step 459. 

Note that instead of using the integer function int for rounding herein, any other 
form of rounding to achieve an integer value may be employed, e.g., always rounding up 
35 or rounding to the nearest integer value. 
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If the test result in step 423 is NO, indicating that the variance was not large 
enough that the data was likely to have been encoded on the 5 th least significant bit of the 
integer portion of the average of the V values of the pixels of the block, control passes to 
conditional branch point 429, which tests to determine if the average Y variance over the 
5 block, var(p,q), is greater than a second prescribed threshold t2v, which is the second 
largest V threshold. An exemplary value of t2v is 15. 

If the test result in step 429 is YES, indicating that the variance is large enough 
that the data was likely to have been encoded on the 4 th least significant bit of the integer 
portion of the average of the V values of the pixels of the block, control is passed to step 
10 431, in which variable m is set equal to 4. Control then passes to step 427, in which the 
value of the m ih least significant bit of the average of the V values of the pixels of the 
block is extracted as the value impressed upon this block. The process is then exited in 
step 459. 

If the test result in step 429 is NO, indicating that the variance was not large 
15 enough that the data was likely to have been encoded on the 4 th least significant bit of the 
integer portion of the average of the V values of the pixels of the block, control passes to 
conditional branch point 433, which tests to determine if the average Y variance over the 
block, var(p,#), is greater than a third prescribed threshold t3v 5 which is the smallest V 
threshold. An exemplary value of t3v is 7. 
20 If the test result in step 433 is YES, indicating that the variance is large enough 

that the data was likely to have been encoded on the 3 rd least significant bit of the integer 
portion of the average of the V values of the pixels of the block, control is passed to step 
435, in which variable m is set equal to 3. Control then passes to step 427, in which the 
value of the least significant bit of the average of the V values over the pixels of the 
25 block is extracted as the value impressed upon this block. The process is then exited in 
step 459. 

If the test result in step 433 is NO, indicating that the variance is only large 
enough that the data was likely to have been encoded on the 2 nd least significant bit of the 
integer portion of the average of the V values of the pixels of the block, control is passed 
30 to step 437, in which variable m is set equal to 2. Control then passes to step 427, in 
which the value of the least significant bit of the average of the V values of the pixels 
of the block is extracted as the value impressed upon this block. The process is then 
exited in step 459. 

If the test result in step 421 is that countU is greater than countV, control passes to 
35 conditional branch point 445, which tests to determine whether the average Y variance 
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over the block, \zx(p s q), is greater than a first prescribed threshold tlu, which is the 
largest U threshold. An exemplary value of tlu is 600. 

If the test result in step 445 is YES, indicating that the variance is large enough 
that the data was likely to have been encoded on the 5 th least significant bit of the integer 

5 portion of the average of the U values of the block, e.g., the value of 
int[sumU(p,q)/(number of pixels per block)], e.g., int[sumU(p,q)/64], control is passed to 
step 445, in which variable m is set equal to 5. Control then passes to step 447, in which 
the value of the least significant bit of the average of the U values over the pixels of 
the block is extracted as the value impressed upon this block. The process is then exited 

10 in step 459. 

If the test result in step 445 is NO, indicating that the variance was not large 
enough that the data was likely to have been encoded on the 5 th least significant bit of the 
integer portion of the average of the U values of the pixels of the block, control passes to 
conditional branch point 449, which tests to determine if the average Y variance over the 

15 block, var(p,#), * s greater than a second prescribed threshold t2u, which is the second 
largest U threshold. An exemplary value of t2u is 15. 

If the test result in step 449 is YES, indicating that the variance is large enough 
that the data was likely to have been encoded on the 4 th least significant bit of the integer 
portion of the average of the U values of the block, control is passed to step 45 1 , in which 

20 a variable m is set equal to 4. Control then passes to step 447, in which the value of the 
m ih least significant bit of the average of the U values of the pixels of the block is 
extracted as the value impressed upon this block. The process is then exited in step 459. 

If the test result in step 449 is NO, indicating that the variance was not large 
enough that the data was likely to have been encoded on the 4 th least significant bit of the 

25 integer portion of the average of the U values of the pixels of the block, control passes to 
conditional branch point 453, which tests to determine if the average Y variance over the 
block, var(p,q), is greater than a third prescribed threshold t3u, which is the smallest U 
threshold. An exemplary value of t3u is 7. 

If the test result in step 453 is YES, indicating that the variance is large enough 

30 that the data was likely to have been encoded on the 3 rd least significant bit of the integer 
portion of the average of the U values of the pixels of the block, control is passed to step 
455, in which variable m is set equal to 3. Control then passes to step 447, in which the 
value of the m th least significant bit of the average of the U values of the pixels of the 
block is extracted as the value impressed upon this block. The process is then exited in 

35 step 459. 
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If the test result in step 453 is NO, indicating that the variance is only large 
enough that the data was likely to have been encoded on the 2 nd least significant bit of the 
integer portion of the average of the U values of the pixels of the block, control is passed 
to step 457, in which variable m is set equal to 2. Control then passes to step 447, in 
5 which the value of the ml* 1 least significant bit of the average value of the U values of the 
pixels of the block is extracted as the value impressed upon this block. The process is 
then exited in step 459. 

Note that although the use of 3 thresholds and 4 bit positions has been shown in 
FIGs. 3 and 4, those of ordinary skill in the art will readily be able to adapt the indicated 

10 method to other numbers of thresholds and encoded values. 

Similarly, not all blocks of each frame or field of the video signal need be 
impressed with additional information. 

FIG. 6 shows an exemplary process for determining which particular chrominance 
portion is more suitable, and so should be selected, to contain the watermarking 

15 information for a pixel, in accordance with the principles of the invention. The process is 
entered in step 601 when it is necessary to select a chrominance portion to contain 
watermarking information. For purposes of discussion of FIG. 6, it is assumed that the 
pixel is represented in YUV format. Furthermore, it is noted that, preferably, for each 
original 2x2 luminance block of original video, had the original video been in 4-4-4 

20 representation, there should only one Y value for each chrominance component, i.e., each 
pair of respective corresponding U and V values. To this end, the Y values of the original 
block may be downsampled so as to have the same resolution as the U and V. 
Alternatively, the average, or some other combination, of the Y values associated with a 
particular U and V values may be computed and used as the Y value for the process of 

25 FIG. 6. 

Conceptually, in accordance with the principles of the invention, each position in 
a three-dimensional YUV colorspace corresponding to a possible pixel position, given the 
full range that a pixel's Y, U, and V values can take, is assigned a chrominance portion, 
e.g., based on experimental observations, that is more suitable, and so should be selected, 

30 for a pixel having such Y, U, and V values. If a version of the entire table for each 
possible set of Y, U, and V values was to be employed, where each of Y, U, and V has a 
full range of 8 bits, at least 16M bit of information would need to be stored, assuming that 
only one bit was stored for each position to indicate the selected chrominance portion. 
Note that use of a single bit only permits selection of U or V, but not a designation that 

35 neither U nor V should be employed. If it were desired to be able to select neither U nor 
V, 32 Mbits of information would be necessary. 



D:\PATENTS\Zarrabizadeh 24\Zarrabizadeh 24.doc 



Zarrabizadeh 24 

A cutaway view of a portion of exemplary assignments of a chrominance portion 
that is to be selected for each possible pixel within a three-dimensional YUV colorspace 
is shown in FIG. 7. Note that FIG. 7 is provided for pedagogical purposes only, as a 
conceptualization visual aid, and does not represent actual data. 
5 In order to reduce the storage requirements, the YUV colorspace may be 

considered to be a group of regions, each region being defined to include the positions 
corresponding to at least one set, and typically multiple sets, of Y, U, and V values, i.e., 
the positions in the colorspace corresponding to at least one pixel, and possibly many 
pixels, and each region, and hence each pixel which maps to that region, is assigned a 

10 chrominance portion, e.g., based on experimental observations, that is to be selected for 
any pixel whose set of Y, U, and V values fall within the region. One way to look at such 
grouping into regions is a quantization, which may be linear or nonlinear. 

Table 1 is a listing for an exemplary colorspace selection table, where each region 
corresponds to 4 Y values, 4 U values, and 4 V values, and hence to 64 possible 

15 combinations of 8 bit values for any pixel. Using such a table reduces the required 
information to be stored down to 256 Kbits, assuming that only one bit was stored for 
each position, or 512 Kbits, assuming it were desired to be able to select Y, V, and 
neither U nor V. Table 1 may be stored in any computer readable medium, e.g., ROM, 
RAM, magnetic storage such as a hard disk or tape drive, optical storage such as a CD- 

20 ROM or DVD-ROM, or the like. 

Those of ordinary skill in the art will readily recognize that the values employed 
in Table 1, which are for each of Y, U, and V having a full range of 8 bits, may be scaled 
for use with 10 bit Y, U, and V values by dividing by 4, e.g., shifting each 10 bit value to 
the right two times. Likewise, other numbers of bits used for Y, U, and V can be 

25 similarly accommodated. 

In order to effectively arrange and access the data of Table 1 , it is arranged so that 
the specified U or V selection, where 1 indicates select U and 0 indicates select V, for 8 
adjacent regions having the same U and V quantized values but different sequential 
quantized Y values, are grouped together to form a byte. Thus, for each U and V value 

30 there are 8 bytes, each corresponding to a region having the same U and V quantized 
values but different quantized Y values. 

Table 1 is arranged to be addressed using an address that has the most significant 
bits corresponding to the U values, the next least significant values corresponding to the 
V values, and the least significant values corresponding to the Y values. In other words, 

35 the address of the bytes may be formed as follows: 

U7|U6|U5|U4|U3|U2|V7|V6|V5|V4|V3|V2|Y7|Y6|Y5 
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where U7,U6,U5,U4,U3, and U2 are the values of the 8 th to 3 rd least significant 
bits of the pixels U value, V7,V6,V5,V4,V3, and V2 are the values of the 8 th to 3 rd least 
significant bits of the pixels V value, and Y7,Y6, and Y5, are the values of the 8 th to 6 th 
least significant bits of the pixels Y value. Then, the particular bit within the byte is 
5 specified by using the 5 th to 2 nd least significant bits of the Y component, e.g., Y4,Y3, and 
Y2. 

A table such as Table 1 is reflective of the facts that the human visual system is 
a) less sensitive to the blue color and b) more sensitive to lower luminance values. Such 
a table may be developed by trial and error, generally as follows. 

10 The colorspace is examined in sections, each section being defined by a 

luminance value and ranging in a first dimension corresponding to a first chrominance 
portion changing from its minimum value to its maximum value and in the second 
dimension corresponding to the second chrominance portion changing from its minimum 
value to its maximum value. Any or all of the luminance and the chrominance portions 

15 may be quantized, e.g., using the 6 most significant bits of 8 bit values. Doing so creates 
a set of planes having a checkerboard of chrominance portion values, which appears when 
displayed as blocks of different colors, one plane for each luminance value. For example, 
quantizing so as to use the 6 most significant bits of 8 bit values for the luminance and 
both chrominance portions yields 64 planes that correspond to each possible quantized 

20 luminance value, and each plane has a checkerboard pattern of colored boxes, with 64 
boxes vertically and 64 boxes horizontally for a total of 4096 boxes per plane. 

Each plane is examined separately. Random data is developed for a number of 
frames sufficient to be confident that the random data will have different values in like 
positioned blocks of the frame over time and for an observer to detect flicker should it 

25 appear. Thirty seconds or longer have proven to be of value. The random data is 
impressed upon frames that contain the plane, but only on a first one of the chrominance 
portions, e.g., using the system of FIG. 1 and the process of FIG. 3 to accomplish the 
watermarking but forcing the color selection to be the first chrominance portion. The 
resulting watermarked version of the frames is displayed and observed. 

30 Any block for which no flicker is observed is indicated in the table that its 

combination of luminance and chrominance portions should employ the chrominance 
portion currently carrying the watermark data as the selected chrominance portion for that 
combination. Any block for which flicker is observed is indicated in the table that its 
combination of luminance and chrominance portions should employ the chrominance 

35 portion that is not currently carrying the watermark data as the selected chrominance 
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portion for that combination. The process is repeated for the plane but changing the 
chrominance portion that is watermarked. 

For any block of a plane that flicker occurs for both chrominance portions, as can 
happen, the implementer may choose which chrominance portion should be selected. For 
example, U may be chosen because the human visual system is generally less sensitive to 
blue. Alternatively, the chrominance portion that would provide for better data 
compression of the resulting table may be employed. Similarly, where flicker does not 
appear on either block, the choice of the chrominance portion to employ is at the 
discretion of the implementer. 

The process is repeated for each plane until the entire table is populated. 
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Step 603 begins the process of accessing the information when so arranged. More 
25 specifically, in step 603, 

and 

V (M) _ yiPA) » 7 

V (M) V {iJ) >>Z 

30 are calculated, 

wherein, similar to that described hereinabove, 

p points to the particular horizontal slice of the frame is being processed and q 
points to the particular column, or vertical slice, of the frame, i points to the particular 
row within the block that is being processed,; points to the particular column within the 
35 block that is being processed, and "»" is a right shift operation. Doing so leaves only 
the desired 8 th to 3 rd least significant bits of the pixels U value, the 8 th to 3 rd least 
significant bits of the pixels V value, and the 8 th to 6 th least significant bits of the pixels Y 
value. Thereafter, in step 605 the lookup table address for the current pixel is calculated 
as 

40 Lm_Adress\$=u%$<<9+ where "«" is a left-shift 

operation. 

Doing so combines the extracted bits into a combined address and points to the 
one byte that corresponds to the pixel. Thereafter, in step 607, the particular bit within 
the byte that corresponds to the pixel is determined, by using the value made up of the 2 n 
45 to 5 th least significant bits of the Y component as an index into the byte. To this end, step 

607 calculates 

6=mod(^f «2,8) 

where mod is the modulo function. 
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In step 609, the value of the 6 th bit position of the byte at the calculated lookup 
table address is extracted, assigned as the value of a variable m, which is supplied as an 
output. Again, in this exemplary embodiment, if the extracted bit is a 1, U is the selected 
chrominance portion while if the extracted bit is a 0, V is the selected chrominance 
5 portion. 

The process then exits in step 611. 

Those of ordinary skill in the art will readily recognize how to adapt the foregoing 
to pixels in other formats, e.g., RGB or YIQ, 

Note that if Huffman encoding of the table is desired, it may be advantageous that 
10 the forgoing correspondence of select U being a 1 and select V being a zero should be 
reversed, assuming, as has been seen experimentally, that U is selected for a majority of 
pixel combinations. 

FIG. 8 shows another exemplary process by which the particular chrominance 
portion is selected to contain the watermarking information for a pixel, in accordance 

15 with the principles of the invention. The process is entered in step 801 when it is 
necessary to select a chrominance portion suitable to contain watermarking information. 
As in FIG. 6, for purposes of discussion of FIG. 8, it is assumed that the pixel is 
represented in YUV format. Furthermore, it is noted that, preferably, for each original 
2x2 luminance block of original video, had the original video been in 4-4-4 

20 representation, there should only one Y value for each chrominance component, i.e., each 
pair of respective corresponding U and V values. To this end, the Y values of the original 
block may be downsampled so as to have the same resolution as the U and V. 
Alternatively, the average, or some other combination, of the Y values associated with a 
particular U and V values may be computed and used as the Y value for the process of 

25 FIG. 8. 

In order to further reduce the storage requirements in the embodiment of FIG. 8, 
as compared to the embodiment of FIG. 6, in accordance with an aspect of the invention, 
not only is the YUV colorspace divided into regions, each region including positions 
corresponding to at least one set of Y, U, and V values, with each region being assigned a 

30 chrominance portion, e.g., based on experimental observations, that is to be selected for 
any pixel whose Y, U, and V values fall within the region, as described in connection 
with FIG. 6, but any pixel that has a U value less than a predefined value, e.g., one-half 
the maximum value, has the U chrominance portion selected for watermarking. Thus, for 
8 bit Y, U, and V, values, if the value of U is less than 128, the U chrominance portion is 

35 always selected for watermarking regardless of the values of V or Y. This is because 
human visual system is less sensitive to the blue component U than the V component. 
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In accordance with an aspect of the invention, by having the most significant 
address bits of the chrominance portion selection table correspond to the U-value-derived 
bits of the address, advantageously, the size of the table can be reduced by up to one half 
This is achieved by adding a test to determine if the U value is less than one half the 

5 maximum value prior to forming the table address, and if the test result is YES, simply 
indicating to select the U chrominance portion and skipping the rest of the process of 
accessing the table, and also by subtracting one half the maximum U value from the 
actual U value prior to calculating the U-value-derived bits of the address. Thus, the 
section of the table employed for FIG. 6 corresponding to the most significant U bit being 

10 0 is eliminated, and only that portion of the table where the most significant U bit is 1 is 
retained. However, the indexing into the remaining portion of the table is shifted by the 
subtraction from the U value of the one half of the maximum U value prior to forming the 
U-value-derived bits. 

Thus, the table is arranged to be addressed using an address that has the most 

15 significant bits corresponding to the U values, the next least significant values 
corresponding to the V values and the least significant values corresponding to the Y 
values. In other words, the address of the bytes may be formed as follows: 
U6|U5|U4|U3|U2|V7|V6|V5|V4|V3|V2|Y7|Y6|Y5 

where U6,U5,U4,U3, and U2 are the values of the 7 th to 3 rd least significant bits of 
20 the pixels U value, V7,V6,V5,V4,V3, and V2 are the values of the 8 th to 3 rd least 
significant bits of the pixels V value, and Y7, Y6, and Y5, are the values of the 8 th to 6 th 
least significant bits of the pixels Y value. Then, the particular bit within the byte is 
specified by using the 5 th to 2 nd least significant bits of the Y component, e.g., Y4, Y3, 
and Y2. 

25 To this end, conditional branch point 802 tests to determine if 

U\?$ < predefined _value, where predefined _value is, for example, one half the 

maximum U value. Note that to save a bit, and half the table size, preferably 
predefined_value should be a power of 2. If the test result in step 802 is NO, indicating 
that the value of U is less than the predefined value, e.g., one half the maximum value of 
30 U, e.g., 128, and hence the chrominance portion to be selected will be a function of Y, U, 
and V, and so the table must be accessed, control passes to step 803 to begin the process 
of accessing the table. In step 803, 

M S? = ( U (U1 - Predefined value) » 2 , e.g., ^ = {U { { $ - 1 28) » 2 
35 and 



D:\PATENTS\Zarrabizadeh 24\Zarrabizadeh 24.doc 



Zarrabizadeh 24 



v (p*q) _ y(p>i) ^ 9 
V VJ) ~ y (U) >>L 

are calculated, 

where, similar to that described hereinabove, 

where p points to the particular horizontal slice of the frame is being processed 
5 and q points to the particular column, or vertical slice, of the frame, i points to the 
particular row within the block that is being processed, j points to the particular column 
within the block that is being processed, and "»" is a right-shift operation. Doing so 
leaves only the desired 7 th to 3 rd least significant bits of the pixels U value, the 8 th to 3 rd 
least significant bits of the pixels V value, and the 8 th to 6 th least significant bits of the 
10 pixels Y value. Thereafter, in step 805 the lookup table address for the current pixel is 
calculated as 

LUT _ Address™ = 9 + v ( ( ™> « 3 + y\$ , where "«" is a left- 

shift operation. 

Doing so combines the extracted bits into a combined address and points to the 
15 one byte that corresponds to the pixel. Thereafter, in step 807, the particular bit within 

th 

the byte that corresponds to the pixel is determined, by using the value made up of the 5 

to 2 nd least significant bits of the Y component as an index into the byte. To this end, step 

807 calculates 

6==mod(^f<<2,8) 

20 where mod is the modulo function. 

In step 809, the value of the 6 th bit position of the byte at the calculated lookup 
table address is extracted and stored in the variable m. The value of variable m is 
supplied as an output in step 811. Again, if the output bit is a 1, U is the selected 
chrominance portion while if the extracted bit is a 0, V is the selected chrominance 

25 portion. The process then exits in step 813. 

If the test result in step 802 is YES, indicating that the U chrominance portion 
should be selected, because the pixel color is not primarily blue and hence changing the 
blue color of the pixel will not be detected by the human visual system, control passes to 
step 815, in which the variable m is set equal to 1. Doing so assures that U is selected. 

30 Control then passes to step 8 1 1 , and the process continues as described above. 

Notwithstanding the foregoing improvements in color selection, with certain Y, 
U, and V values for a pixel, there is still, disadvantageous^, a possibility that a slightly 
detectable flickering be manifest. This is because in order to survive MPEG-like 
encoding there may be a need to add large values to the average value of the selected 

35 chrominance portion. 



D:\PATENTS\Zarrabizadeh 24\Zarrabizadeh 24.doc 



44 



Zarrabizadeh 24 

FIG. 9 shows an exemplary transmitter arranged in accordance with the principles 
of the invention, and in which the flickering may be reduced by replicating the data to be 
impressed, at least once, and preferably two or more times, prior to its being impressed 
upon the average value of a chrominance portion of a block. The original and each 
5 replica are transmitted in the same block position of separate consecutive frames. 
Preferably, the frames having like-positioned blocks carrying the same data are 
consecutive in display order. Furthermore, specific blocks of the frame may be embedded 
with a particular known data sequence, e.g., a Barker sequence, rather than encoded user 
data. 

10 The embodiment of the invention in FIG. 9 is similar to that of FIG. 1. All 

like-numbered elements of FIG. 9 operate substantially the same as in FIG. 1. In addition 
to those elements of FIG. 1 that are shown in FIG. 9 are repeater 925 and optional 
sequence adder 927. In addition, bit mapper 123 of FIG. 1 is optionally replaced in FIG. 
9 by bit mapper 923. Replacement of bit mapper 123 by bit mapper 923 is necessary only 

15 if the additional functionality described hereinbelow in connection with bit mapper 923 is 
desired. 

Repeater 925 receives bits either from block interleaver 121 or optional sequence 
adder 927. Repeater 925 stores the received bits and outputs them for like-positioned 
blocks of at least two frames. In one embodiment of the invention, it has been found that 

20 good results are achieved when repeater 925 stores the received bits and outputs them for 
the like-positioned blocks of three frames. Those of ordinary skill in the art will be able 
to trade-off any perceived flicker with desired throughput of the watermark data by 
choosing the number frames for which the data is repeated. 

Optional sequence adder 927 embeds a particular known data sequence, e.g., a 

25 Barker sequence, in specific blocks of the frame, the data sequence being in lieu of 
encoded user data. The specific blocks in which the data sequence is encoded may be 
scattered throughout the blocks of a frame. Each group of initial and repeated data frames 
may employ a different known sequence. Doing so will enable the receiver to detect the 
grouping of the frames. Alternatively, the same sequence may be employed for each 

30 group but the specific blocks used for the sequence may be different for consecutive 
groups. 

FIG. 10 shows an exemplary embodiment of a receiver arranged in accordance 
with the principles of the invention for use in receiving a watermarked video signal, such 
as that produced by the transmitter of FIG. 9. The embodiment of the invention in FIG. 
35 10 is similar to that of FIG. 2. All like-numbered elements of FIG. 10 operate 
substantially the same as they do in FIG. 2. In addition to those elements of FIG. 2 there 
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are shown in FIG. 10 sequence processor 1025 and frame weighting unit 1027. 
Furthermore, channel decoder 221 of FIG. 2 is optionally replaced in FIG. 10 by channel 
decoder 1021. 

A receiver, e.g., as shown in FIG. 10, may detect group synchronization using 

5 sequence processor 1025. This may be performed by adding up the values of the group 
identification sequence from each frame of a group-length-number of consecutive frames, 
which thus are employed as a synchronization pattern, and determining if the result 
exceeds a prescribed threshold. If the threshold is exceeded, it is assumed that the first 
frame whose expected synchronization pattern value was added is the first frame in the 

10 group. If the threshold is not exceeded, it is assumed that the first frame whose value was 
added is not the first frame of a group. This is analogous to performing an 
autocorrelation on the synchronization pattern. Those of ordinary skill in the art will 
recognize that other conventional techniques for avoiding false matches, as well as 
handling missing the first frame due to errors, such as searching for a maximum prior to 

15 declaring group synchronization, may be employed. 

Advantageously, once the receiver detects the regular group pattern, any time 
there is a deviation from the pattern the receiver will be able to recognize that a frame of 
the original video sequence has been removed. Such information may be supplied as an 
output by sequence processor 1025. 

20 For example, various commercials of a vendor within a video signal may be 

monitored. The vendor may be assigned a unique code that is embedded in each frame of 
its commercial. A receiver is made aware of the particular unique code and which blocks 
of the watermarked frames should contain the code. By detecting the appearance of the 
code within watermarked frames, the receiver can identify a frame as being one that 

25 belongs to one of the commercials of the vendor. Once a frame with the code is detected, 
the number of sequential frames incorporating the code can be counted to determine the 
length of the commercial. If the number of frames counted is less than the anticipated 
number of frames based on the known length of the commercial when it was originally 
watermarked, it may be assumed that the commercial was inappropriately shortened by 

30 removing the number of frames that corresponds to the difference between the anticipated 
number of frames and the counted number of frames. Those of ordinary skill in the art 
will recognize that other conventional techniques for avoiding false matches, as well as 
handling missing the first frame due to errors, may be employed. 

Each frame of the commercial, or groups of frames within the commercial, may 

35 be watermarked with a unique identifier, e.g., a frame or group number, which is part of a 
distinct sequence over the frame. When a gap in the expected sequence is detected due to 
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one or more missing frames, the missing frames may be specifically identified when each 
frame has a unique identifier. When identifiers are assigned only to groups and the 
number of frames in each group is known, only the particular group to which any missing 
frames belongs may be identified, along with the count of how many frames are missing. 

5 Although replication of the data may be employed to reduce flicker, as indicated 

hereinabove, doing so may limit the ability to detect missing frames to merely identifying 
the group from which the frame is missing, rather than being able to identify the 
particular frame. Therefore, although the watermark data is generally replicated, at least 
an individual frame identifier may not be replicated. The blocks containing such non- 
10 replicated frames are placed where they will be least likely to attract attention should they 
cause flickering, e.g., the corners of the frame. Doing so provides the majority of the 
benefit of reducing detectable flicker, while also allowing particular individual frames 
that are missing to be detected. 

If a vendor has different commercials, each of the commercials may have a further 

15 sequence embedded in at least one of its frames to identify the particular commercial of 
that vendor that is being received. 

Should multiple vendors have watermarked commercials, so long as each vendor 
is assigned a unique code, a system monitoring for the appearance of the commercials of 
a first vendor with a first unique code will ignore commercials of a second vendor with a 

20 second unique code. Alternatively, a single system may monitor a video signal for the 
appearance of commercials from different vendors that each have a unique code, and the 
results may be segregated by vendor based on their codes. 

In another arrangement in which multiple vendors have watermarked 
commercials, each vendor employs the same code, and the code may even be at the same 

25 block locations within the frame for each vendor. However, all the subsequent data 
contained within the frame is encrypted using a unique key for each vendor and each 
vendor has a receiver that knows only the key for that vendor. Therefore, each vendor 
can only decrypt and receive data from its own commercials. In another arrangement, the 
data for each vendor may be encrypted by scrambling the data over the blocks of a frame. 

30 Each receiver would know only the scrambling pattern for its associated vendor. 

Monitoring for an initial appearance of a code indicating the start of a commercial 
may be performed continuously, or within a window of time during which the 
commercial is expected to be broadcast. 

In accordance with an aspect of the invention, instead of simply repeating the data 

35 over the multiple frames of a group and then using bit mapper 123 (FIG. 1), the amount 
added to the average value of a chrominance portion of a block, which depends on the 
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complexity of the block and its anticipated quantization level, may be changed slightly 
from frame to frame over a group, even when the complexity of the block is the same at 
corresponding locations from frame to frame. The change that is made is small with 
respect to the value being added to the average to place the watermark bit within the 

5 average value. Such changes may be performed by bit mapper 923 (FIG. 9), thereby 
providing additional coding gain that may be advantageously employed to improve the 
reliability of the data at the receiver. However, doing so may cause a slight reduction in 
visual quality of low texture areas, because a few pixels within the block may have 
different values than their predecessors in the same location. However, because such 

10 reduction is at the pixel level, it is typically not noticeable. 

In one arrangement, groups of three time-consecutive frames are transmitted with 
the same watermark data being impressed thereon. The middle frame of the group is 
watermarked as described above in connection with FIG. 3, without changing the amount 
added to the average value of the selected chrominance portion of the block from the 

15 value determined in FIG. 3. 

The first-in-time frame of the group also has a value computed to be added, i.e., 
an offset bias, by bit mapper 923 (FIG. 9), to the average value of the selected 
chrominance portion of the block that is developed as described in connection with 
FIG. 3. However, the bias, e.g., one quarter or, preferably one half, of the absolute value 

20 of the value being added to the average to place the watermark bit within the average 
value, is additionally added to the computed average value of the chrominance portion 
selected to carry the watermark data. Thus, for example, if one is being added to the 
average value to place the watermark bit within the average value, then one half is added 
to the average value. This translates to adding 32 to the sum of the values of the selected 

25 chrominance portion of all the pixels of the block when there are 64 pixels in a block. 
Thus, summer 133 will received a higher value than it would have had the bias not been 
added. Similarly, as another example, if -4 is being added to the average value to place 
the watermark bit within the average value, if one half of the absolute value of the value 
added to the average value is employed, this translates to adding 128 to the sum of the 

30 values of the selected chrominance portion of all the pixels of the block when there are 64 
pixels in a block. 

Note that this additional bias amount, e.g., 32, will be distributed throughout the 
various pixels based on their luminance variances. Also, this addition of the bias is 
independent of any value added to the average to bring it with a safe range. As a result, 
35 the average value may fall outside of the safe range. However, the increase in error 
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probability engendered by moving out of the safe range is more than offset by the 
resulting coding gain resulting from employing the bias. 

The last-in-time frame of the group has a value computed to be subtracted, i.e., an 
offset bias, by bit mapper 923 (FIG. 9), from the average value of the selected 

5 chrominance portion of the block that is developed as described in connection with FIG. 
3. However, the bias, e.g., one quarter or, preferably one half, of the absolute value of the 
value being added to the average to place the watermark bit within the average value, is 
additionally subtracted from the computed average value of the chrominance portion 
selected to carry the watermark data. Thus, for example, if -3 is being added to the 

10 average value to place the watermark bit within the average value, then one half of the 
absolute value of -3, i.e., 1.5, is subtracted from the average value. This translates to 
subtracting 96 from the sum of the values of the selected chrominance portion of all the 
pixels of the block when there are 64 pixels in a block. Thus, summer 133 will received a 
lower value than it would have had the bias not been subtracted. Similarly, as another 

15 example, if 2 is being added to the average value to place the watermark bit within the 
average value, then one half of the absolute value of 2, i.e., 1, is subtracted from the 
average value. This translates to subtracting 64 from the sum of the values of the selected 
chrominance portion of all the pixels of the block when there are 64 pixels in a block. 

Note that the loss of the subtracted bias amount, e.g., 32, will be distributed 

20 throughout the various pixels based on their luminance variances. Further note that this 
subtraction of bias is independent of any value added to the average to bring it with a safe 
range. As a result, the average value may fall outside of the safe range. However, the 
increase in error probability engendered by moving out of the safe range is more than 
offset by the resulting coding gain. 

25 One way to think about how this works is to look at FIG. 5. As described 

hereinabove, without considering the bias amount oftentimes just enough is added to, or 
subtracted from, the average value of the selected chrominance portion of a block in order 
to reach one of the outer borders of the safe range. Thus, prior to any bias, many frames 
are on or near the border of the safe range. The middle frame to which nothing is added 

30 or subtracted remains right on the border. The frame to which a slight bias is added may 
move slightly to be better positioned within the safe range, or it may move slightly out of 
the safe range. The frame from which a slight bias is subtracted moves in the opposite 
direction as the frame to which the bias is added. Thus, in the worst case, for a group of 
three frames one will be within the safe range, one will be on the border of the safe range, 

35 and one will be slightly out of the safe range. This results in an independent spread of 
values. 
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The effect of the bias may be further magnified because of the quantization that is 
performed by MPEG-like encoding and the separate MPEG-bias that is added during the 
MPEG dequantization. This can result in significant differences in the received data 
values for like-positioned blocks within consecutive frames even when the same bit is 
transmitted over those consecutive frames. 

At the receiver, e.g., as shown in FIG. 10, the data extracted from each frame is 
weighted appropriately using maximum ratio combining based on a quality level that is 
believed to be present for each frame, e.g., in frame weighting unit 1027. To this end, 
sequence processor 1025 may supply to frame weighting unit 1027 a) frame 
synchronization information, so that frame weighting 1027 can know which frames are 
grouped together, and b) the number of errors in the synchronization pattern of each 
frame. The quality level is determined based on how many errors are believed to be in 
the received frame, which can be determined based on how many errors there are in the 
synchronization pattern that is expected for that frame, as extracted by sequence 
processor 1025. Table 1 shows a number of errors for each synchronization pattern and a 
respective weight that has been empirically derived to be appropriate for a frame with 
such a number of errors in its synchronization pattern. In other words, the values of the 
extracted data from each frame may be treated as soft data that is weighted by its 
associated weight as part of the combining process. 

Based on the weights, the multiple instances of the same data bit for 
corresponding block locations in successive frames are extracted and combined to form a 

single received bit. This may be achieved by computing 

,~n « w t bit,+w 7 bit 2 + wJ>it 3 , 

bit out=(2"-l)-*—j 2 J— 3 -, where 

(w, + w 2 +w> 3 ) 

bit out is the final output bit for the group of three frames; 

w h w 2 , and w 3 are the weights for each of the first, second and third in time 

frames; 

bitu bit!, and bih are the bits from the like-positioned block of the first, second 
and third in time frames; and 

n is the number of bits that the soft decoder input precision. 

To best make use of the soft information, channel decoder 1021 is a so-called soft 
decoder that employs soft data bits, i.e., data bits that are each represented as a non binary 
number the range of which depends on the soft decoder input precision. For example, an 
8 bit input precision soft decoder operates with values between 0 and 255. To this end 

w.bit, +w-)bit 7 + w,bit, ... .. , , 

the weighted average of the received hard bits, - L - ! 2 — , is multiplied by 
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2" - 1 , thereby converting the weighted average into a soft value of the appropriate 
precision that can be processed by the soft decoder. 

When the determined quality of a particular frame is below a prescribed threshold, 
it may be assumed that the particular frame does not contain any watermarking data and 
5 no data is extracted for that frame. 

Those of ordinary skill in the art will readily recognize that which frame has the 
value added, which has it subtracted and which has no change; whether addition and 
subtraction are both necessary; the number of frames in a group; and any rounding to be 
performed on the value to be added or subtracted or the resulting value are at the 
1 0 discretion of the implementer. 
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